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ABSTRACT 

We explore the near-infrared (NIR) A-band properties of galaxies within 93 galaxy clusters and groups 
using data from the Two Micron All Sky Survey (2MASS). We use X-ray properties of these clusters 
to pinpoint cluster centers and estimate cluster masses. By stacking all these systems, we study the 
shape of the cluster luminosity function and the galaxy distribution within the clusters. We find that 
the galaxy profile is well described by the NFW profile with a concentration parameter c ~ 3, with no 
evidence for cluster mass dependence of the concentration. Using this sample, whose masses span the 
range from 3 x 10 13 A/ Q to 2 x 10 15 M©, we confirm the existence of a tight correlation between total 
galaxy NIR luminosity and cluster binding mass, which indicates that NIR light can serve as a cluster 
mass indicator. From the observed galaxy profile, together with cluster mass profile measurements from 
the literature, we find that the mass-to-light ratio is a weakly decreasing function of cluster radius, 
and that it increases with cluster mass. We also derive the mean number of galaxies within halos of 
a given mass, the halo occupation number. We find that the mean number scales as A tx M 0S4±0M 
for galaxies brighter than Mk = —21, indicating high mass clusters have fewer galaxies per unit mass 
than low mass clusters. Using published observations at high redshift, we show that higher redshift 
clusters have higher mean occupation number than nearby systems of the same mass. By comparing the 
luminosity function and radial distribution of galaxies in low mass and high mass clusters, we show that 
there is a marked decrease in the number density of galaxies fainter than M* as one moves to higher 
mass clusters; in addition, extremely luminous galaxies are more probable in high mass clusters. We 
explore several processes- including tidal interactions and merging- as a way of explaining the variation 
in galaxy population with cluster mass. 

Subject headings: cosmology: observation - galaxies: clusters: general - galaxies: luminosity function, 
mass function - galaxies: formation - infrared: galaxies 



1. INTRODUCTION 

Understanding galaxy formation is one of the most out- 
standing challenges in cosmology. The development of 
both semianalytic (e.g. White & Rees 1978; Kauffmann 
et al. 1993; Cole et al. 2000, among others) and numerical 
(e.g. Kauffmann et al. 1999; Springel et al. 2001) modeling 
have enjoyed tremendous success, in the sense that those 
models are able to match several observed properties, such 
as the galaxy luminosity function (LF), the morphologi- 
cal mix, the color, the mass-to-light ratio of the galaxies 
(Cole et al. 2000). But as the quality of the observational 
constraints improves, we can expect additional theoretical 
challenges (e.g. Benson et al. 2003). 

The clustering properties of dark matter and galaxies 
pose another tough task, for both theorists and observers. 
The recent advent of the so-called halo model has intro- 
duced an important new tool on this subject (e.g. Seljak 
2000; Peacock & Smith 2000; Scoccimarro et al. 2001). 
The essential ingredients of this model include a descrip- 
tion of halo abundance as a function of cosmic time and 
halo mass (the mass function, e.g. Press & Schechter 1974; 
Sheth & Tormen 1999; Jenkins et al. 2001), a model for 
halo structure (e.g. Navarro et al. 1997; Moore et al. 1998), 
and a prescription for the bias of haloes (e.g. Mo & White 



1996; Sheth & Tormen 1999). 

The halo occupation distribution (HOD) is a powerful 
tool which links the physics of galaxy formation with the 
clustering of dark matter and galaxies (Benson et al. 2000; 
Kauffmann et al. 1999; Berlind & Weinberg 2002; Berlind 
et al. 2003; Kravtsov et al. 2003). It assumes that the 
evolution and clustering of haloes are determined by the 
underlying cosmology, and that the physics that governs 
galaxy formation specifies the way galaxies populate the 
haloes. The calculations of various power spectra of dark 
matter and galaxies are a natural outcome. 

The important ingredients within the HOD framework 
are the mean number of galaxies per halo A as a function 
of halo mass, the probability distribution that a halo of 
mass M contains N galaxies P(N\M), and the relative dis- 
tribution (both spatial and velocity) of galaxies and dark 
matter within haloes (Berlind & Weinberg 2002). 

Here we aim to provide observational constraints on the 
HOD, based on our study of 93 clusters and groups using 
data from the Two Micron All-Sky Survey (2MASS, Jar- 
rett et al. 2000). Using X-ray properties of these clusters to 
define the cluster center and estimate the cluster binding 
mass, we determine the mean halo occupation number N 
as a function of mass from ~ 3 x 10 13 M Q to ~ 2 x 1O 15 M 
and also investigate the galaxy distribution and luminos- 
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ity function within the clusters. We discuss the bearing of 
the N-M relation on the hierarchical structure formation 
paradigm. 

Part of our analysis is built on the tools that we de- 
velop in an earlier paper (Lin et al. 2003, hereafter pa- 
per I) , where we examined the near-infrared (NIR) galaxy 
luminosity-cluster binding mass correlation {L-M rela- 
tion) for a sample of 27 nearby clusters, using the sec- 
ond release of the 2MASS data. Here, for a much larger 
sample, we will study the galaxy distribution within the 
clusters and the faint-end shape of the cluster luminos- 
ity function, solve the K s -ba,nd luminosity functions for 
individual clusters, and derive the total light and galaxy 
number within the virial radius as a function of cluster 
binding mass. Our findings provide some constraints on 
cluster evolution scenarios. 

In §2 we briefly describe the technique developed in pa- 
per I for examination of the cluster NIR L-M relation. 
Next in §3 we begin our analysis with two fundamental 
properties of galaxy clusters: the luminosity function (LF) 
and the spatial distribution of the member galaxies. With 
constraints on galaxy distributions both in real space and 
in luminosity space derived directly from the data, we pro- 
ceed to calculate the L-M relation, point out the impor- 
tance of the contributions from the brightest cluster galax- 
ies (BCGs) (§4.1), and examine the mean halo occupation 
number (§4.2). We investigate the possible mechanisms 
that arc responsible for the observed behavior of the halo 
occupation distribution in §5. Possible systematics that 
may affect our results are discussed in §6. Finally, in §7, 
we summarize our results. The Appendix provides some 
further tests of the robustness of our analysis. 

Throughout the paper we assume the density parame- 
ters for the matter and the cosmological constant to be 
= 0.3, Oa = 0.7, respectively, and the Hubble param- 
eter to be H = 70h 70 km s _1 Mpc -1 . 

2. ANALYSIS OVERVIEW 

Our analysis begins with identifying a sample of clus- 
ters which have reliable mass and cluster center estimates. 
Based on this information we extract galaxies from the 
2MASS extended source catalog that lie within the virial 
radius for each cluster. The number and light of mem- 
ber galaxies in each cluster are estimated in a statistical 
sense. We first stack the radial distributions of galaxies to 
study the galaxy distribution. We then build a composite 
luminosity function from all clusters in the sample, which 
allows us to examine the faint end slope of the luminos- 
ity function (LF). With this information, we can derive 
the luminosity functions of individual clusters in a self- 
contained way. Equipped with these tools, we are able to 
address issues relating to the halo occupation distribution 
(§4), and the possible transformation of galaxy popula- 
tions as low mass clusters accrete, merge and grow into 
high mass clusters (§5). 

X-ray observations provide robust estimates of cluster 
binding mass and the position of the center. In particu- 
lar, the tight correlation between emission-weighted mean 
X-ray temperature Tx of the intracluster medium (ICM) 
and cluster total mass indicates that Tx serves as a good 
proxy for mass (e.g. Evrard et al. 1996; Finoguenov et al. 
2001). We therefore assemble our sample from several ex- 



isting X-ray cluster samples (Mohr et al. 1999; Reiprich & 
Bohringcr 2002: Finoguenov et al. 2001; Sanderson et al. 
2003; David et al. 1993; O'Hara et al. 2003), and use mea- 
sured Tx (i.e. not inferred from X-ray luminosity or other 
observables) to estimate cluster mass. The redshift infor- 
mation is obtained from NED and/or SIMBAD, and the 
above catalogs. The emission-weighted mean X-ray tem- 
perature is also taken from the literature cited above. The 
X-ray centers are either estimated from archival ROSAT 
images, or taken from literature (Reiprich & Bohringcr 
2002; Bohringer et al. 2000; Ebeling et al. 1996, 1998). 

We only consider systems which do not have nearby 
neighbors: we eliminate the systems that have clusters or 
groups with measured redshifts (i.e. confirmed identity) 
lying within a circle of 2 ~ 3 virial radii projected on the 
sky. We further remove the systems for which the signal- 
to-noise is too low, i.e. the ratio of the number of galaxies 
within the virial radius to the estimated statistical back- 
ground/foreground galaxy number (see below). Finally, 
to reduce any confusion from stellar objects in the 2MASS 
catalog, we select systems at galactic latitude |6| > 10°. 
With these criteria we end up with a sample of 93 clusters 
and groups (hereafter we often refer to all systems in our 
sample as clusters, for simplicity). The X-ray tempera- 
ture in our sample ranges from 0.8 to <~ 12 keV, with a 
redshift range 0.01 < z < 0.09. 

Given Tx of the clusters, we use the mass-temperature 
(M-Tx) relation provided by Finoguenov et al. (2001) 

1 58 +0 ' 06 

to obtain M500 = (47r/3)500p c r| 00 , the mass enclosed by 
rsoo, within which the mean overdensity is 500 times of the 
critical density of the universe p c . The virial radius r 2 oo is 
converted from r5oo using the profile proposed by Navarro 
et al. (1997, hereafter NFW) with concentration Q m = 5 
(e.g. van der Marel et al. 2000; Biviano & Girardi 2003). 
We then search the 2MASS all-sky release archive to col- 
lect galaxies that lie within r2oo and are brighter than 
the completeness limit K s ^u m — 13.5 (hereafter we de- 
note K s as K for simplicity). We use the "20 mag/square 
arcsec isophotal fiducial elliptical aperture magnitude" of 
the 2MASS final release. We estimate the total magni- 
tudes by subtracting 0.2 mag from the isophotal magni- 
tudes (Kochanek et al. 2003). The number and bright- 
nesses of foreground/background galaxies that lie within 
the cluster virial region is estimated from the log N — log S 
relation derived from the 2MASS all-sky data (T. Jarrett 
2003, private communication). In addition to the Pois- 
son uncertainty, the contribution from background galaxy 
clustering is also included in our error analysis in § 4 (e.g. 
Peebles 1980, see the Appendix); although this term is ig- 
nored in §§ 3 & 5, it does not affect our analysis or results 
(see the discussion in §3.2). The background subtracted 
galaxy number and luminosity, N Q b s & L b s , enable us to 
solve for the cluster LF, which is assumed to be of the 
Schechter (1976) form: 

^ L)dL = ^{i~y ^ L/L ' d {i~)' (2) 

where the parameters X*, 4>* an d ct are the character- 
istic luminosity and number densities, and the faint-end 
power law index, respectively. We seek constraints on a 
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directly from our data by stacking the luminosity func- 
tions of galaxies in all 93 clusters (§3.2). The remaining 
two parameters are then determined from the two observ- 
ables. We note that the BCGs usually do not conform to 
the cluster LF, and we treat them separately when deter- 
mining the number count and luminosity required to solve 
for the LF parameters. We identify BCGs as the bright- 
est galaxies within the cluster virial radius, and we obtain 
their rcdshifts from NED to ensure cluster membership. A 
further investigation on the BCG properties will be pre- 
sented in a separate paper (Lin and Mohr 2004, in prepa- 
ration, hereafter paper III). Once L» & 4>* are found, we 
integrate the LF to obtain the total luminosity and num- 
ber of galaxies (now including the BCGs) to a limiting 
magnitude Mk,Iow = —21. Our approach is described in 
greater detail in §2 of paper I. 

3. BASIC PROPERTIES OF STACKED CLUSTERS 

In this section we first study the galaxy distribution in 
clusters by their surface density profile (§3.1), then in §3.2 
examine the composite cluster LF (especially the faint-end 
behavior). These are needed for the next section, where 
we solve for the LF for each cluster and find the expected 
galaxy (subhalo) number. 

3.1. Galaxy Surface Density Profile 

An important aspect of the HOD formalism is the galaxy 
distribution within dark matter haloes. Different distribu- 
tions will result in different clustering statistics (e.g. two- 
point correlation function) at small scales (Seljak 2000; 
Peacock & Smith 2000; Berlind & Weinberg 2002). Here 
we study the galaxy distribution within our cluster sample 
by stacking the projected galaxy distributions. 

Large iV-body simulations suggest that dark matter 
has a "universal" density profile (NFW, Navarro et al. 
2003). The profile is characterized by a scale radius 
r s = r2oo/cdrm where Cdm is the concentration parameter. 
The overall normalization of the density profile is charac- 
terized by the halo formation epoch; when the normaliza- 
tion is scaled out, haloes of different mass are expected to 
have the same structure. We therefore stack the clusters 
with radial distance scaled by their virial radii r2oo- 

Because of the different nature of galaxies and dark 
matter, it is not necessary that both follow the same dis- 
tribution. We express the 3D number density profile as 
n(x) — nox _1 (l + x)~ 2 where no is the normalization, 
and x — c g r/r 2oo- Note that we allow the galaxies to 
have concentrations different from that of dark matter. 
The surface density is then an integral of the 3D profile 
£(#) = 2riQr s J^ 2 cos 6>(cos 8 + x)~ 2 d9 (see e.g. Bartel- 
mann 1996). 

We assume the observed surface density is composed of 
the projection of the cluster profile and a constant back- 
ground. The best-fit surface density profile is determined 
by a maximum likelihood method. Specifically, we mini- 
mize the quantity 

-log£ = -^og(^|^ -logG(N obs ,N tot ), (3) 

where N b s is the total number of galaxies in the stacked 
cluster, Xi is the radial distance of the i-th galaxy, N to t is 



the model prediction of the total number of galaxies (mem- 
ber galaxy number N c u plus background) as a function of 
no & c 9 , and G is a Gaussian with mean of Ntot and stan- 
dard deviation \fN~tot. In the fitting routine we allow the 
parameters N c i s and c g to vary, while N to t is constrained 
by the Gaussian kernel to reproduce the observed total 
galaxy number N b s - 

Fig 1 shows the radial distribution of galaxies in the 
stacked cluster from ~ 0.02 r 2 oo out to 2.5r 2 oo, as well as 
the best-fit NFW profile (which is obtained using the data 
out to r2oo only). Out to the virial radius, the stacked 
cluster contains 6608 galaxies (the BCGs are excluded), of 
which 1467 are estimated to be background. It is interest- 
ing to see how well the simple NFW model fit works out to 
r > r 200, given that we only use a statistical background 
subtraction method, and the fact that we do not pre-select 
the clusters to be circular/regular in shape or in dynamical 
equilibrium. The galaxy concentration is c g = 2.90^022' 
where the uncertainties are determined from the relation 
Ax 2 = — 2 A log £ with two degrees of freedom. We also 
notice the unit of the surface density is "number per virial 
area" . 
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Fig. 1. — Composite galaxy surface density with BCGs excluded. 
The best-fit NFW profile is shown. The fit is obtained by fitting 
the data within r2oo on ly- The figure shows the distribution of 6608 
galaxies, of which 1467 (22%) are estimated to be background. The 
lower panel shows the residuals (differences between data and the 
fit) in units of standard deviation, which does not include the con- 
tribution from galaxy clustering. 

The figure shows that the NFW model is a reasonable 
description of the galaxy distribution out to large radius. 
This is consistent with the analyses of 14 CNOC clusters 
at intermediate redshifts (z ~ 0.3, Carlberg et al. 1997; 
van der Marel et al. 2000), who found that c g = 3.7 & 4.2, 
respectively. In particular, in comparison to the galaxy 
surface density fit of the latter (see their equation 2), our 
nearby 2MASS clusters seem to have a slightly broader dis- 
tribution. Using clusters drawn from the ENACS, Adami 
et al. (1998, 2001) find that a profile with a core is generally 
preferred over a cuspy profile. Although the distribution 
of fainter (e.g. Mk > —22.5 mag) galaxies in our sample 
indeed can be well described by /3-models, the distribution 
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of all (or brighter) galaxies is better fit by cuspy profiles 
such as the NFW or a generalized-NFW profile (of the 
form n(x) oc x~ a (l + x) a ~ 3 with 1 < a < 2, e.g. van der 
Marel et al. 2000). In fact, a generalized-NFW profile with 
a = 1.07, c g — 2.71 provides a good fit to the total galaxy 
distribution as well. 

We note that the concentration parameter we obtain is 
the galaxy number weighted average over all clusters. As 
shown in §5.2, the galaxy concentration in high and low 
mass clusters is similar, and so the value obtained from 
all clusters is representative. The concentration c g we find 
lies at the lower bound of measured dark matter concen- 
trations Cdm ; there have been many attempts to study the 
matter distribution within clusters, via analysis of galaxy 
dynamics (e.g. van der Marel et al. 2000; Biviano & Girardi 
2003; Katgert et al. 2004, Cdm ~ 4 — 5), X-ray emission 
from the intracluster medium (e.g. Pratt & Arnaud 2002; 
Lewis et al. 2003, Cdm ~ 4 — 5), weak/strong lensing (e.g. 
Clowe & Schneider 2002; Arabadjis et al. 2002, c dm ~ 3-8) 
or the caustic approach (e.g. Geller et al. 1999; Biviano & 
Girardi 2003; Rines et al. 2003, Cdm ~ 4 — 17). In general 
these findings are in agreements with the direct numeri- 
cal simulations of the dark matter clustering (e.g. NFW, 
Bullock et al. 2001). 

We can also compare our result with the numerical stud- 
ies in which distinct subhaloes are resolved. Combin- 
ing high-resolution numerical simulation and semianalytic 
galaxy formation model, Springel et al. (2001) find that 
galaxy distribution in their simulated Coma-like cluster is 
not the same as that of the dark matter. More recently, a 
study of subhalo properties within a large cluster ensem- 
ble shows that the subhaloes are less concentrated than 
the mass (De Lucia et al. 2004). 

In the following sections, we take c g = 3 when converting 
the number of galaxies contained in the cylindrical volume 
to that in the virial sphere, necessary when we solve for 
LFs for individual clusters (§4.1). On the other hand, we 
use Cdm = 5 for conversion between different cluster radii 
(e.g. rsoo to T2oo)- We return to the surface density profile 
in §5.2 for a subset of our sample. 

3.2. Cluster Luminosity Function 

We stack the clusters in luminosity space to generate a 
composite LF as follows. For each cluster, we transform 
the apparent magnitudes of all galaxies within the clus- 
ter virial radius r2oo to absolute magnitudes, assuming all 
galaxies are at the cluster redshift. Galaxy magnitudes are 
then binned with 0.25 mag width. The faintest bin corre- 
sponds to the bin whose faint edge is just above (brighter 
than) the absolute magnitude of the 2MASS completeness 
limit at the cluster redshift. The logiV — \ogS relation 
is translated to absolute magnitude for statistical back- 
ground subtraction. We then apply the k-correction of 
the form k(z) — — 61og(l + z), following Kochanek et al. 
(2001). The estimated number of cluster member galaxies 
is divided by the cluster volume (adjusted from a sphere 
to a cylinder to take the projection effect into account, see 
§3.1 above). 

The Schechter function generally underestimates the 
abundance of very bright galaxies, which is mainly due 
to the presence of the BCGs (e.g. Schechter 1976; Goto 
et al. 2002; Christlein & Zabludoff 2003). Their luminos- 



ity seems to be drawn from a distribution different from 
the usual cluster LF. We treat these objects separately 
when solving the cluster LF for individual clusters, and 
also do not include them when stacking the LFs. We dis- 
cuss the effects of the BCGs on the light -mass relation in 
§4.1. 




Fig. 2. — 1-C-band composite cluster luminosity function with 
BCGs excluded. The solid line shows a Schechter function with 
a = -0.84, M Kt , - 51og 10 ?»7o = -24.02 and </>„ = 4.43/i 70 
Mpc -3 , and the dashed line shows a fit with a fixed at —1.1, 
M K „ - 51og 10 h 70 = -24.34 and <f>, = 3.01/i 70 Mpc -3 . The uncer- 
tainties do not include the contribution from galaxy clustering. 

Figure 2 shows the composite LF, which is comprised 
of 5932 galaxies, of which 1256 are estimated to be back- 
ground. Although the bright end shows the expected ex- 
ponential drop, the scatter at the faint-end needs further 
discussion. Statistical background subtraction is adequate 
when a bin contains contributions from several clusters 
from different areas on the sky. However, we notice that 
only a couple of clusters contribute to the faintest bins, 
and this together with the growing background correction 
may explain the larger scatter. Nevertheless, we expect the 
behavior of the composite LF to represent the underlying 
true cluster LF at Mk— 5 log 10 /170 < —21. In Fig 2 two fits 
are shown (obtained via chi-square fitting): the dashed- 
line (a = -0.84±0.02, M K „-51og 10 h 70 = -24.02 ±0.02, 
<f>* = 4.43 ± O.ll/i^o Mpc -3 ) is the best- fit, which seems 
to fit most of the bins at the bright end (we notice that 
the small number of galaxies in the faintest bins makes 
their statistical weight small). On the other hand, the 
solid line, which is obtained by fixing a at —1.1 while let- 
ting the other two parameters vary, may reflect the gradual 
rise at the faint end more faithfully (Mk* = — 24.34±0.01, 
0» = 3.01 ± 0.04 Mpc -3 ). Although it is reasonable to as- 
sume the true value of a lies within these two values, in the 
absence of deeper photometry (with depth to Mk = — 19 
or so) or an even larger sample of nearby clusters, we will 
assume a = —1.1, the same value we use in paper I, in the 
following sections. We note that the error bars in Fig 2 do 
not include the contribution from galaxy clustering, and 
therefore underestimate the true uncertainty at the faint- 
end; our choice of a = —1.1 is therefore roughly consistent 
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with the data. In §6 we will test the sensitivity of our 
results to the choice of a. 

It is important to compare our results to previous stud- 
ies: Balogh et al. (2001) use the second incremental release 
data from 2MASS to build a composite LF for clusters, 
based on 274 cluster galaxies with redshift measurements 
from the Las Companas Redshift Survey. Despite the large 
scatter due to the small number of galaxies in their sample, 
they find that M K * = -24.58 ± 0.40 and a = -1.3 ± 0.43. 
The agreement in Mk * is encouraging, while the difficulty 
in determining a is illustrated. Our Mk* is also close to 
the value in more distant clusters: Mk* = —24.27 ± 0.49 
for the z = 0.15 bin in the sample of De Propris et al. 
(1999, note that they assume a = —0.9). Using a sample 
of 5 clusters which span a large range in redshift, Tren- 
tham & Mobasher (1998) estimate that a = -1.38 ± 0.24 
in iif-band and find the slope is not a strong function of 
redshift. A study of a z = 0.3 cluster in X-band finds 
a = —1.18, and the exact value may depend on the lo- 
cation within the cluster (Andreon 2001). In ii-band the 
existing studies show similar values for a (Andreon & Pello 
2000; Tustin et al. 2001, but see De Propris et al. 1998). 

In optical bands there are many systematic studies of 
the cluster LF. For example, the 2dF collaboration stud- 
ies a sample of 60 clusters in bj band and finds that the 
composite LF is characterized by a = —1.28 (De Propris 
et al. 2003). This study finds that there is little variation 
in the LF between low and high mass clusters (see §5.1). 
Using the SDSS commissioning data Goto et al. (2002) 
study a large sample of clusters and find that the slope 
becomes less steep in redder bands (see De Propris et al. 
2003; Goto et al. 2002 for comparisons between various 
studies). 

We can also compare the cluster LF with the "field" LF. 
From 4192 galaxies with redshift measurements, Kochanek 
et al. (2001) find that M K * = -24.16 ± 0.05 and a = 
-1.09 ± 0.06, using the 2MASS data. Recently, by com- 
bining the 2MASS and SDSS EDR photometry, Bell et al. 
(2003) find that a = -0.77 and M K * = -24.06. At 
slightly higher redshift (0.1 < z < 0.3), Feulner et al. 
(2003) find M K * = -24.56 ±0.24, and a = -1.1. Overall, 
the field and the cluster LF are similar, which is in agree- 
ment with the findings of previous studies in red bands 
(e.g. Trentham & Mobasher 1998; De Propris et al. 1998; 
Christlein & Zabludoff 2003; Andreon 2003). 

The cluster LF within a smaller area (rsoo) is very sim- 
ilar to that obtained at r200j with a higher signal-to- noise 
ratio with respect to the background; the LF is com- 
posed of 3785 galaxies, of which 549 are estimated to 
be background. The best-fit Schechter parameters are 
a = -0.83 ± 0.02, M K * - 51og 10 /i 70 = -24.02 ± 0.03, 
4>* — 10.06 ± 0.29/jyo Mpc~ 3 ; these are almost identical to 
that obtained at r2oo , with a larger (f>* . This larger value 
is expected because, by definition, the region enclosed by 
rsoo is denser than that within r2oo- Furthermore, the 
ratio 0*,5oo/^*,2oo = 2.27 is intriguing: a galaxy distribu- 
tion with c g = c dm = 5 gives 0*,5oo/0*,2oo = 2.5, while 
a c g — 3, Cdm = 5 model gives a ratio of 2.3, which is in 
better agreement with the above value. This is another 
indication that the galaxies follow an NFW distribution, 
but have a smaller concentration compared to the dark 
matter. 



4. STATISTICAL PROPERTIES OF CLUSTER ENSEMBLE 

Using the composite LF and galaxy distribution in our 
data, we calculate the total light for each cluster and ex- 
amine the L-M relation, paying particular attention to 
the BCGs, which we neglect when creating the composite 
cluster LF and galaxy distribution. We show that exclu- 
sion of the light contributions from the BCGs significantly 
changes the L-M relation. Specifically, we will see that 
without these BCGs, the galaxy light in clusters reflects 
the galaxy number. We discuss possible implications of 
this finding for cluster formation scenarios, and we further 
examine constraints on these scenarios in the next section. 
Table 1 contains parameters for the scaling relations pre- 
sented in this section, derived at both rsoo & ?"200 ■ I n Table 
2 we present the derived quantities (Schechter function pa- 
rameters, total light and galaxy number) for the clusters 
in our sample. 

4.1. Luminosity - Mass Correlation and the Effects of 
the Brightest Cluster Galaxies 

kT x (keV) 
0.8 1 2 4 6 8 10 

I 1 I 1 I 1 I I 1 I 1 I 1 




M (h _1 10 14 M ) 

Fig. 3. — i^-band luminosity— mass correlation within rsoo- The 
best-fit relation has a slope of 0.69 ± 0.04. The scatter about the 
best-fit is 32%. For most of the clusters the uncertainties in light 
is smaller than the size of the points. For clarity we do not show 
the uncertainty in cluster mass (see Fig 5). At the top is the X— ray 
temperature, from which M500 is estimated (see Eqn 1). 

We show in Fig 3 the correlation for the ensemble of 93 
clusters. The systems in our sample, which span one and 
a half orders of magnitude in mass, show striking regular- 
ity in their total NIR galaxy light. The figure also shows 
the best-fit relation, which is obtained by minimizing the 
vertical distance from the data points to the line: 



10 12 h^L Q ' ' \2 X 10 u h^M Q J 

(4) 

where the uncertainty is obtained from bootstrap resam- 
pling and refitting 1000 times. The Spearman correlation 
coefficient is 0.86, with a probability of 0(1O -28 ) that such 
a correlation happens by chance. As in paper I, we obtain 
the total light by integrating the LF of individual clusters 
from a limiting luminosity corresponding to Mi ow — — 21. 
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Changing this luminosity does not affect the slope we find. 
However, we caution that more massive clusters tend to 
be more distant (because our sample is roughly X-ray- 
selected), integrating their LFs requires more extrapola- 
tion than low mass systems. We discuss possible system- 
atics associated with this, and the choice of the faint-end 
slope of the LF in §6. 

Our result confirms the existence of the NIR L-M re- 
lation we found in paper I, and the slopes are consistent: 
based on 27 clusters, using the second incremental release 
data we found L 500 cx MgoQ 9±0 ' 09 . We notice that the 
similarity between the slopes found is a coincidence: the 
inclusion of the k-correction, which is ignored in paper 
I, reduces the slope. Furthermore, the amplitude of the 
current L-M relation is ~ 20% smaller than that found 
previously. This is primarily due to the inclusion of the 
fc-correction (12%) and the different galaxy radial profile 
concentration (6%). 

The rms scatter about the best-fit scaling relation 
(Eqn 4) is ~ 32%. We note that because the total light 
depends on the cluster mass, the scatter may well be a 
mix of some intrinsic scatter in the if-band light and that 
due to the X-ray M-Tx relation. In order to estimate 
the intrinsic scatter in the AT-band light, we examine the 
light contained within a fixed metric radius of 0.75 Mpc 
(instead of fixed overdensities) , as this way the quantities 
in the two axes are independent. In addition to luminosity 
uncertainties, we add a component af nt that represents the 
intrinsic scatter in the \ 2 fitting. The best-estimate of <Ji n t 
is such that the reduced x 2 = 1. We find this indicates 
that 24% scatter in the L-M relation would be intrinsic. 
This impressive regularity, combined with the wide range 
in mass over which the light correlates with mass, shows 
the potential use of the galaxy NIR light as a proxy for 
cluster binding mass (to an accuracy < 43%), which will 
be very useful in optical/NIR cluster surveys. We further 
investigate this aspect in §7. 



The L-M relation measured within r2oo is 
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Fig. 4. — Total iC-band galaxy light within a fixed metric radius 
of 0.75 Mpc. The correlation indicates that 24% scatter in the L-M 
relations (e.g. evaluated at fixed overdensities) is intrinsic. 
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= 5.64±0.16 



M, 



200 



2.7 x 10 



14 



h 70 Mq 



0.72±0.04 



(5) 

where the normalization mass is the M200 corresponding 
to that used in Eqn 4. In obtaining this, we solve for indi- 
vidual cluster LFs at r^oo- The individual L*s are not very 
different from those solved at r5oo; the mean of the ratio 
of the two quantities for all the clusters is £*,5oo/-k*,200 = 
1.07. However, as we note in §3.2, the characteristic den- 
sity is very different. We have 0*,5oo/0*,2oo = 2.22, which 
is very close to the ratio obtained from the fits to the com- 
posite LFs. Furthermore, the ratio of the mass-to-light ra- 
tios evaluated at the two radii is T500/T200 = 1-04. This 
means that the cluster mass-to-light profile is only slightly 
decreased toward larger radii, which is consistent with 
some previous findings (e.g. Rines et al. 2001; Katgert 
et al. 2004; see Biviano & Girardi 2003 for a review). An 
equivalent way of stating this is that light increases only a 
bit faster with radius than the mass does (our results imply 
that £200/^500 = 1-43, compared to M200/-M500 = 1.38 for 
an NFW profile with a m = 5). 
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Fig. 5. — ii"-band mass— to-light ratio within r200- The solid line 
shows the best-fit relation, which has slope 0.26 ±0.04. The dashed 
line shows the universal mass-to— light ratio, derived using the !!m 
measured by WMAP and the luminosity density estimated by Bell 
et al. (2003). The dotted lines show the uncertainties in T un { v . 

An immediate implication of the L-M relations found 
here is that the cluster mass-to-light ratio is an increasing 
function of mass. Figure 5 shows the mass-to-light ratio 
calculated at r2oo, "f^oo- ^ e best-fit relation is almost 
identical to that inferred from Eqn 5. The uncertainties 
in T 2 oo is obtained by summing in quadrature the con- 
tributions from the cluster radius, total light, the dark 
matter and galaxy concentrations. Our finding of an in- 
creasing mass-to-light ratio is consistent with other stud- 
ies (in i^-band: paper I; in optical: Girardi et al. 2002; 
Bahcall & Comerford 2002, and in group scales: Pisani 
et al. 2003; Tully 2003). It is interesting to compare 
the cluster mass-to-light ratio with that of the universe 
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line in the figure), where we take the measurement of Qm 
from WMAP (Bennett et al. 2003), and use the mean lu- 
minosity density j measured by Bell et al. (2003). The 
mass dependence of cluster mass-to-light ratio makes it 
less secure as an indicator of the matter density parame- 
ter, unless the relative distribution of light and matter is 
taken into account (Ostriker et al. 2003). 

The existence of the L-M relation (or an increasing 
mass-to-light ratio) is intriguing in two ways. First, it in- 
dicates that the star or galaxy formation process in clus- 
ters and groups is regular. It is thus relatively easy to 
predict the stellar mass in a cluster, once a mean stellar 
mass-to-light ratio is known. (Please see §4.1 in paper I 
for further discussion.) Second, the slope of the scaling de- 
serves more attention; because the slope is less than unity, 
high mass clusters seem to produce light (stars) less ef- 
ficiently than low mass clusters. In paper I we attribute 
this to a possible change in star formation efficiency across 
this mass range. Although this is one viable cause, we will 
seek other possible explanations within the context of the 
HOD. For now we remind the reader that we are only ac- 
counting for light within galaxies. 

The brightest cluster galaxies are among the most lu- 
minous objects in the universe. Their contribution to the 
cluster luminosity is not negligible (e.g. Oemler 1976; Gon- 
zalez et al. 2000). We have identified the BCGs in the 
clusters in our sample and estimate their luminosity. In- 
terestingly we find that, although the distribution of the 
BCG luminosity L\> cg is relatively narrow (in terms of K- 
band magnitude, the mean is Mk = —26.18 ± 0.05 mag, 
with a 0.45 mag scatter), there exists a tight correlation 
between L\, cg and the total luminosity L500 (which is the 
sum of the luminosity from all galaxies). These results 
will be presented elsewhere (paper III). The most impor- 
tant implication from this investigation is that the BCG 
contribution to the total light is a crucial factor in shaping 
the L-M correlation. Because a larger proportion of the 
light in group scale systems comes from the BCG, exclu- 
sion of Lbcg steepens the slope of the light-mass relation. 
Denoting the total light from all galaxies but the BCG as 



-^500 — -^500 



Lbcg, it is found that L| 00 cx M®qq 2 - a 3cr 



change in slope! 

The BCGs are the main cause of a flatter L-M relation. 
We turn to the halo occupation number now to further 
examine the origin of the L s -M relation. 

4.2. Halo Occupation Number 

We examine the halo occupation number using the 
galaxy LF within each cluster. Once the Schechter pa- 
rameters (/>» & L,,, are determined for each cluster (with- 
out the BCG), the total number of galaxies brighter than 
a luminosity threshold Li ow is simply N = 1 + N s , 
where the first term denotes the BCG count, and N s = 
V(f)* Iy^ oui y ae ~ vd y^ where yi ow = Li ow /L* and V is the 
cluster volume. 

Excluding the BCGs results in a L-M relation at r$oo of 
slope 0.82 (c.f. Eqn 4, Table 1). This demonstrates the im- 
portance of light from BCGs to the total luminosity bud- 
get. However, unlike the total luminosity, the mean num- 
ber of galaxies should be much less sensitive to the absence 
of BCGs, since they only stand for one count per clus- 
ter. Including the BCGs, we find that N 50Q oc M 5 ° g 2±0 04 ; 



without the BCGs, the resulting halo occupation number 
is (Fig 6) 

M50Q 



m m {M K < -21) = 56 ±2 



2 x 10 14 M Q 



0.84±0.04 



(6) 

and indeed only small change in slope is found. The super- 
script emphasizes this number does not count the BCGs. 
We note that the rms scatter about the best-fit line is 
35%, slightly larger than that of the light-mass relation. 
The halo occupation number, as well as other derived LF 
parameters, for each cluster, is listed in Table 2. 

Interestingly, the slope of the N s -M relation is essen- 
tially the same as the BCG-free L s -M relation. This im- 
plies that, except for the BCGs, the sum of galaxy lu- 
minosities scales as total galaxy number in clusters (i.e. 
the shape parameter L» of the luminosity function does 
not change with cluster mass). This is true also for the 
pair of relations evaluated at r 2 or> ^200 00 -^200 2± ° ° 4 ' an d 
iVIoo « ^ 2 oo 7±0 - 04 (Table 1). 

The bottom panel of Fig 6 shows a p , a measure of the 
spread in P(N\M), defined as the ratio of the second mo- 
ment to the first moment of the occupation number 

al^(N(N-l))/(N) 2 
(Berlind et al. 2003; Kravtsov et al. 2003). In the case 
that the distribution is Poisson, a p — 1. We calculate a p 
by averaging the occupation numbers in 7 mass bins, each 
of which has a roughly equal number of clusters. 




10 14 10 15 
M 5oo (h^o 1 M o) 

Fig. 6. — Top: Number of galaxies as a function of cluster mass. 
The best-fit relation gives N oc M0.84±o.04 i Bottom . deviation of 
the halo occupation number from a Poisson distribution. a p = 1 for 
Poisson, while a narrower (broader) distribution has a v < 1(> 1). 

There are several observationally determined results for 
the halo occupation number. Most of these come from 
matching the observed galaxy two-point correlation func- 
tion or power spectrum (e.g. Seljak 2000; Peacock & Smith 
2000; Yang et al. 2003; Zehavi et al. 2003; Magliocchetti 
& Porciani 2003). Generally speaking, this is done by as- 
suming a form of the HOD, adjusting the parameters until 
the prediction from the halo model matches the observed 
galaxy clustering. There are other groups who use differ- 
ent approach to study the HOD. For example, by studying 
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the velocity dispersions of groups in the Updated Zwicky 
Catalog, Pisani et al. (2003) find that N oc M°- 70±0 ' 04 . 
Using the LF from the Nearby Optical Galaxy sample, 
Marinoni & Hudson (2002) obtain the HOD for objects 
with a wide range in mass. An advantage of using the LF 
approach to determine the properties of the HOD is that 
we can measure any moment of the HOD, unlike the ap- 
proach relying on the measurements of galaxy clustering, 
which is sensitive to the first and second moments only. 

Another approach similar to our use of the NIR N— 
M relation is presented in Kochanek et al. (2003, here- 
after K03). They identify clusters in the 2MASS all-sky 
data using a matched filter cluster finding algorithm. They 
also calculate the N-M based on a sample of 84 clusters 
with X-ray temperatures and find that A 666 oc 7 1 ^. 09±017 j 
where A^666 is the number of galaxies brighter than = 
—24.16 within r 2 oo- With the X-ray mass-temperature 
relation M500 oc Tjf adopted in our work, their rela- 

1 ^ O —1— Oil 

tion goes as A 666 oc M 500 ' , which is inconsistent with 
our result. K03 examine the N-M relation using sev- 
eral different mass estimators, and the average slope is 
7 = 1.10 ± 0.09, closer to our results. 

We have considered possible explanations for these dif- 
ferences. A way to bias the slope of the N-M relation is to 
have a mass-dependent systematic in the analysis. In both 
studies a constant faint-end slope a is assumed (although 
the values differ slightly). As shown below in §5.1, there 
is no evidence for variations in a with mass to a depth of 
Mk = — 21. They report the number of galaxies brighter 
than Af_, which they assume is fixed, whereas we report 
the number of galaxies brighter than Mi ow = —21, a rea- 
sonable effective absolute magnitude limit for our cluster 
sample. We fit for M* independently in each cluster, but 
there is no trend for varying M» with mass, so this cannot 
explain the difference. As discussed in paper I, we believe 
that the most important difference between the K03 anal- 
ysis and ours is the cluster selection and mass estimation. 
Specifically, we use the peak in X-ray emission as the clus- 
ter center and the emission-weighted mean temperature 
as the mass estimate. Thus, we examine galaxy properties 
within a region that is defined using X-ray observables. 
K03 find their clusters and estimate galaxy number and 
light using only NIR observables. NIR observables are less 
accurate mass estimators (see Figure 3), introducing ad- 
ditional scatter in the observed galaxy number at a fixed 
mass. Note that the scatter about their N-M relation de- 
rived for systems with X-ray temperature (their eq. 35) 
is approximately 2.5 times larger than the scatter about 
ours. Because of the steepness of the mass function, this 
scatter will tend to move more low mass systems to higher 
estimated mass than high mass systems to lower estimated 
mass; this will generically lead to a systematic bias in the 
slopes of scaling relations. In estimating cluster light or 
galaxy number K03 use a galaxy model with fixed metric 
radius, and then attempt to bootstrap to the correct light 
or number using the properties of the detected clusters. It 
is well known that a matched filter approach works best 
when the target and filter are most similar, and so this ap- 
proach could easily be a source of mass-dependent system- 
atics in their analysis; K03 work hard to tune their method 
using N-body simulations to remove these systematics. We 
now know this approach is likely to be flawed, because our 



Figure 1 shows that the galaxies are distributed in a way 
consistent with an NFW model with concentration c ~ 3, 
which is less concentrated than the observationally esti- 
mated dark matter profiles in clusters (see discussion in 
§3.1). Thus, accurately tuning a cluster finding algorithm 
with simulations requires a galaxy population that has dif- 
ferent properties than the dark matter. 

Most of the above studies and our results show that at 
large halo masses, N s oc M 7 , where 7 < 1. From the 
theoretical view point, this is expected. For example, in 
the standard paradigm of semianalytic models galaxies are 
assumed to form in dark haloes of mass above a thresh- 
old, below which there is not a sufficient amount of gas to 
cool to initiate galaxy formation. At the low mass end of 
the halo population, there are 1 or galaxies per halo, on 
average. As the structure grows hierarchically, the small 
haloes merge and galaxies merge (to the central galaxies of 
the newly formed halos, within a dynamic friction time). 
Although gas in the larger mass haloes still can cool and 
form galaxies, the efficiency of gas cooling is a steeply de- 
creasing function of halo mass (e.g. Cole et al. 2000). In 
this picture, therefore, the mass naturally increases faster 
than the number of galaxies. 

Recent high resolution simulations have confirmed these 
expectations (e.g. Berlind et al. 2003; Kravtsov et al. 
2003). In particular, the latter, a pure A-body simula- 
tion, finds that 7 <~ 1 for subhaloes of different abun- 
dances, as well as at different cosmic epochs. By matching 
the number density of their subhaloes to that of galaxies 
at M r < —18 (roughly corresponding to Mk = —21.3, 
using the field LF from Kochanek et al. 2001), they find 
7 = 0.92 ±0.03, in good agreement with our result at r2oo- 

A slope 7 < 1 indicates that more massive clusters have 
fewer galaxies per unit mass, compared to their lower mass 
counterparts. A naive explanation is that clusters "lose" 
galaxies as they merge toward the high mass end. To be 
more specific, 7 ~ 0.84 implies that a 10 15 M Q cluster has 
~ 32% fewer galaxies than the sum often 10 14 M Q clusters. 
(Of course, a 1O 15 M cluster need not be a direct merger 
remnant of ten 10 14 M Q clusters.) A natural question is 
then, how do the galaxies disappear? An equivalent way 
of thinking about this is to ask: can we make a present 
day 1O 15 M cluster by merging less massive clusters or 
groups we see today? How will the merger process alter 
the population of galaxies within their progenitor haloes? 
We continue in the next section to study the implications 
of the L-M and N-M relations. 

5. GALAXY POPULATION TRANSFORMATION 

Within the hierarchical structure formation scenario, 
high mass clusters are made of low mass clusters through 
the processes of merging and accretion. If nothing de- 
stroys galaxies in the merging process (which is unlikely) 
or through some other process that does not have a de- 
pendence on cluster mass, we would expect N oc M, and 
other properties concerning cluster baryons such as the 
stellar mass fraction and the cluster mass-to-light ratio to 
remain the same in high and low mass clusters. However, 
observations show that this is not the case (e.g. paper 
I, Sanderson et al. 2003). Identifying physical processes 
whose efficiency varies with cluster mass is therefore an 
important step toward understanding the meaning of the 



if -band Properties of Galaxy Clusters 



9 



Table 1 

L — M Correlation & Halo Occupation Number 
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L-M and N-M relations. 

We first list a few candidate processes that may change 
the galaxy populations as clusters merge and grow in mass 
(e.g. Treu et al. 2003): tidal stripping, "galaxy harass- 
ment" (cumulative tidal disruption due to impulsive en- 
counters between galaxies or cluster substructures, Moore 
et al. 1998), dynamical friction, galaxy mergers, ram pres- 
sure stripping, variable star formation efficiency, and the 
ageing of stellar populations. These mechanisms may not 
be independent of each other, but their action can trans- 
form galaxies' internal properties, making them fainter, or 
changing their spatial distribution. In the following sec- 
tions we attempt to use the observational constraints from 
our 2MASS cluster study to gauge the relative importance 
of the mechanisms listed above. 

5.1. Constraints from Luminosity Function 
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Fig. 7. — Comparison of LFs for most and least massive 25 clus- 
ters. The best-fit Schechter functions are shown (solid line: high 
mass cluster LF; dashed line: low mass cluster LF). The faint-end 
slopes are similar (down to the completeness limit Mj( i ow = —21 
of the high mass clsuter LF). The lower panel shows the difference 
between data points with the best-fit LF for low mass clusters. Low 
mass clusters show deficit of extremely bright galaxies, but have 
higher density for galaxies of moderate luminosities (M > M*) com- 
pared to high mass clsuters. 

In Fig 7 we show the composite LFs for the 25 most 
and least massive clusters. The mean (median) masses 



M500 for the two subsamples are 5.3(4.7) x 10 14 M Q and 
7.9(8.0) x 10 13 M Q , respectively. Because the high mass 
clusters in our sample tend to be at higher redshifts, their 
LF is only sampled down to Mk ~ —21, while the LF 
of the low mass clusters is probed about one magnitude 
deeper. For meaningful comparison, we only consider 
down to Mk = — 21. 

The best-fit to the high mass cluster LF (hereafter 
HMLF) is a = -0.84 ± 0.03, M* = -24.10 ± 0.04 & <j>* = 
4.00 ± 0.16 Mpc~ 3 , while that for the low mass cluster LF 
(hereafter LMLF) is a = -0.81±0.04, M* = -23.94±0.06 
= 5.34 ± 0.33 Mpc~ 3 . We note that in both LFs the 
BCGs are excluded. From the figure it is seen that the low 
mass clusters lack extremely bright galaxies (therefore the 
fainter M„) but are more abundant in faint galaxies (thus 
the larger </>*). To make this clearer we plot in the lower 
panel of the figure the deviation of the low and high mass 
cluster LFs from the best-fit LMLF, in units of standard 
deviation. At Mk > —23.5 the LMLF points are higher 
than the HMLF, and at Mk < — 26 there are no galaxies 
in low mass clusters. 

The mechanisms that build the cluster halo occupation 
distribution must also be responsible for transformation 
of the LFs. One naive way to convert the LMLF to the 
HMLF is to reduce the abundance of galaxies fainter than 
M* in low mass clusters, and make more bright galaxies 
that are of comparable luminosity as the BCGs in low mass 
clusters. Tidal stripping, ram pressure stripping, galaxy 
harassment, differences in star formation efficiency, and 
stellar ageing, can be responsible for the first step, while 
dynamical friction and mergers between galaxies may act 
to complete the second step. We examine their effects in 
more detail below. 

Because of the tidal field of the host halo, subhaloes or- 
biting within a cluster will lose their mass continuously 
(e.g. Merritt 1984; Klypin et al. 1999; Hayashi et al. 2003; 
De Lucia et al. 2004). Utilizing the expressions in Klypin 
et al. (1999), we estimate the average fractional mass loss 
of a galactic halo of mass m within a cluster of mass M to 
be 

Am /" r " Am 

(m,M)= / (m,M,r)£.(r)4 : iTr 2 dr, 

m Jq m 

where r is the orbit radius, r v is the cluster virial ra- 
dius, and £{r), the likelihood of galaxy distribution within 
the cluster, is taken to be an NFW profile with c g = 3. 
For 10 11 < m < 10 13 M Q , we find that Am(m,M)/m ~ 
0.35 - 0.4 for M = 10 14 M o . For larger cluster halo mass 
M, the mass loss is somewhat smaller, which is due to 



10 



Lin, Mohr & Stanford 



the smaller concentration of the more massive halo. These 
estimates may well be lower limits, as the simulations sug- 
gest (Hayashi et al. 2003). Estimating the amount of light 
that is stripped off is more problematic, because it involves 
knowledge of the relative distribution of light and mass 
within subhaloes. Nevertheless, this simple exercise shows 
tidal stripping may effectively make galaxies dimmer- if 
not completely destroy them- particularly in less massive 
clusters. 

Ram pressure stripping has been proposed to explain 
the possible transformation between normal spiral galaxies 
and SOs in intermediate redshift clusters (Poggianti et al. 
1999). Numerical studies show that, with a time scale of 
~ 10 7 yr, ram pressure stripping can deplete ~ 50% of 
gas from a disk galaxy (Abadi et al. 1999). However, this 
mechanism is most efficient in the central regions of the 
clusters, where the intracluster gas is densiest; it is there- 
fore important to check the spatial distribution of galaxies 
in order to assess the contribution of ram pressure strip- 
ping. Regardless, the changing of galaxy type is likely not 
relevant in explaining the systematic variation in the lumi- 
nosity function as we move from low mass to higher mass 
clusters. 

Tidal interactions during fast encounters between galax- 
ies or between a galaxy and a local substructure (galaxy 
harassment) is an effective way of transforming galaxies 
(Moore et al. 1998). As numerical studies show (Gnedin 
2003), tides are not necessarily strongest during core pene- 
tration. In fact, most of the strong tidal forces appear near 
encounters with local substructure (e.g. within infalling 
groups). As clusters continue to grow by accreting haloes, 
more and more galaxies get transformed (spirals become 
SOs, low surface brightness galaxies and dwarf spheroidals 
get destroyed and produce diffuse intracluster light, while 
ellipticals, which may be in place well before clusters form, 
are less affected). It is found that low surface brightness 
galaxies with luminosity near L* are destroyed most effi- 
ciently (Gnedin 2003). However, they probably only con- 
stitute a small proportion of the galaxy population, and 
galaxies of this sort are unlikely to appear in our 2MASS 
sample (Bell et al. 2003); therefore, their destruction could 
not account for the slope of the HOD in our analysis. 

Another way of altering the halo occupation number 
above some limiting absolute magnitude is to alter the 
luminosity of galaxies, moving them into or out of the 
sample. Varying star formation efficiency across cluster 
mass scales acts in the right direction to reduce the num- 
ber of galaxies in higher mass clusters (e.g. Cole et al. 
2000; Springel & Hernquist 2003). As for stellar ageing, 
studies indicate that stellar ages in high and low mass clus- 
ters do not differ much (e.g. Stanford et al. 1998). Using 
ageing effects to explain our if-band results is especially 
difficult, because of the reduced sensitivity to young stars. 
The NIR stellar mass-to-light ratio varies only a factor 
of 2 over a Hubble time (Madau et al. 1998). We em- 
phasize, though, that neither of these physical processes 
are at work in the numerical study of the halo occupation 
distribution where a similarly, flat N-M relation is seen 
(Kravtsov et al. 2003). Thus it is clear that the galaxy 
transformation we observe can be explained without re- 
sorting to these processes. 

As for the bright end of the LF, the appearance of bright 



galaxies in high mass clusters may be the result of merging; 
some of the bright galaxies may be merger remnants oc- 
curring in lower mass haloes, and some may be the BCGs 
in less massive clusters that have recently merged with 
the main cluster. Because tidal stripping is very effective, 
producing very bright galaxies from what were previously 
BCGs in subclusters may require merging with other less 
luminous galaxies during cluster infall and subsequent viri- 
alization. 

In summary, the transformation of the LFs may well in- 
volve several mechanisms. In the next section, we try to 
further differentiate these processes by inspection of the 
galaxy spatial distribution. 

5.2. Constraints from Galaxy Distribution 

One way of flattening the slope of the N-M relation 
would be if galaxy distributions in high mass clusters are 
more extended (with respect to the virial radius) than in 
low mass clusters. We check this possibility by examining 
the galaxy distributions in low and high mass clusters. 
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- Top: Comparison of profiles for galaxies brighter and 
fainter than Mji = —23.5 in all clusters. The best-fit profiles have 
c g = 3.0±0.3 (bright) & 2.8±0.3 (faint). Bottom: Galaxy distribu- 
tion in the most and least massive 25 clusters (the mean masses are 
shown in the figure). The galaxy radial distributions are consistent 
with each other. The two profiles are fit with c g = 2.8 ± 0.3 (high 
mass) & 2.9 ±0.5 (low mass). We adjust the relative normalization 
of the profiles for ease of comparison. 

In Fig 8 (lower panel) we show the profiles for stacked 
clusters built from the 25 most and least massive clusters 
(again excluding their BCGs). The normalization of the 
profiles is arbitrarily chosen so the two profiles are clearly 
displayed. These two radial distributions are statistically 
consistent, indicating that groups and clusters have similar 
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radial galaxy distributions. Fitting the NFW model to the 
two, the best-fit concentration parameters are 2. 84j^32 & 



2 BQ+0.55 



-0 52' f° r high mass and low mass clusters, respec- 
tively. Therefore, any variation of galaxy concentration 
with respect to cluster mass must be weak. 

Our finding indicates that, whatever the mechanisms 
that are responsible for the galaxy population transforma- 
tion, they have to operate in such a way that the galaxy 
radial distribution is unaffected. This suggests that mech- 
anisms whose efficiency strongly depends on the typical 
clustercentric distance of a galaxy are not preferred. Thus, 
it may be that ram pressure stripping and mean field tidal 
stripping are not major contributors to the population dif- 
ference in high and low mass clusters, unless the bulk of 
galaxies are on highly eccentric orbits that take them near 
the cluster center. 

By combining several high-resolution simulations, De 
Lucia et al. (2004) are able to address the subhalo radial 
distribution in parent haloes of different masses, under the 
effects of tidal stripping. They find that the (3D) subhalo 
number density is similar in host haloes of mass 10 14 and 
1O 15 M (their figure 6), which is in good agreement with 
our finding. However, they also find that more massive 
subhaloes tend to reside at larger radii from the center 
of host haloes, because of the efficiency of tidal stripping 
within the cluster environment. In Fig 8 (upper panel) 
we examine the distribution of bright and faint galaxies 
(with an arbitrary luminosity separation at Mk — —23.5) 
and find that the two galaxy populations have similar pro- 
jected radial profiles (the bright galaxy distribution has 
c g — 2. 99^0 25, while the faint galaxy distribution has 

c g = 2.75±o;ll)- We note that when the BCGs are in- 
cluded, the profile of bright galaxies becomes very cuspy 
and thus quite different from that of faint galaxies; how- 
ever, De Lucia et al. (2004) do not include the BCGs in 
their sample either (which are identified as the friend-of- 
friend groups themselves). If the mass of a subhalo has 
a simple correlation with the luminosity of the galaxy 
that resides inside, the simulation result appears to be 
inconsistent with our analysis. A further disagreement 
appears when comparing the galaxy distribution directly 
with that of subhaloes in numerical simulations (Diemand 
et al. 2004); subhaloes have a much less concentrated dis- 
tribution. 

The results in these two sections seem to pose some dif- 
ficulties for dynamical processes. Tidal interactions surely 
can cause mass loss in subhaloes, and may produce the ob- 
served galaxy profile (assuming there is an one-to-one cor- 
respondence between a subhalo and a galaxy). However, 
the majority of galaxies that need to be absent in high 
mass clusters (those fainter than M*) may not be easily 
destroyed by tides. In addition, the similar spatial distri- 
bution of galaxies (in high and low mass clusters and for 
bright and faint galaxies) provides some argument against 
tidal interaction, ram pressure stripping, and cannibalism 
by the central galaxy. 

5.3. Redshift Evolution of the HOD 

Comparison of present epoch groups and clusters may 
lead to a biased picture of the transformation of the galaxy 
population. The progenitors of the present day high mass 
clusters, namely the lower mass clusters at higher red- 



shifts, may not have the same properties as present day low 
mass systems. It may be that the moderate or low mass 
clusters at high redshift have systematically fewer galaxies 
per unit mass than their present epoch counterparts. In- 
terestingly high resolution TV-body numerical studies have 
shown just the opposite; the normalization of the halo oc- 
cupation number of subhaloes increases by a factor of 3 
from z = to z = 5 (Kravtsov et al. 2003). 

A clean way to examine this would be to trace the clus- 
ter LFs to high redshifts and compare clusters of roughly 
the same mass. An existing dataset makes this comparison 
possible (De Propris et al. 1999, a heterogeneous sample 
of 38 clusters from 0.1 < z < 1); each cluster in the sample 
is imaged in if s -band to about M*(z) + 2 mag (assuming 
passive evolution in galaxy luminosity). We collect the 
emission-weighted mean temperature Tx from literature 
for 20 of these clusters, and we re-scale the background- 
corrected number of galaxies to rsoo for the clusters with 
Tx information, rsoo is estimated by assuming the stan- 
dard evolution of the M-Tx relation (e.g. Mohr et al. 
2000). Using the best-fit values for M* at different cos- 
mic epochs, we estimate the number of galaxies to a depth 
two magnitudes below M* for each cluster in the high-z 
sample. We also calculate the halo occupation number for 
our local sample in the same way. By doing so we hope 
to reduce any systematic differences between the low and 
high redshift cluster samples. 




Fig. 9. — Comparison of halo occupation number for our local 
sample (solid points) and the distant clusters from the sample of De 
Propris et al. (1999, hollow points). We plot the ratio between the 
derived galaxy number and the best-fit N(M) for each cluster in the 
two samples. The best-fit relation is obtained from the local sam- 
ple with cutoff magnitude at Mx = —22, roughly two magnitudes 
fainter than M K „ (c.f. Eqn 6, §3.2). 

This exercise shows that the clusters at high redshift 
have a larger halo occupation number, compared to the 
clusters of similar mass in our low-z sample. The results 
are shown in Fig 9, where we plot the ratio of the total 
galaxy number to the expected number in a local cluster 
of the same mass (using the best-fit N-M relation of the 
nearby sample). As expected, the nearby clusters scat- 
ter about unity. However, the clusters at higher redshifts 



12 



Lin, Mohr & Stanford 



almost all have a larger number of galaxies. The result 
holds true when the comparison is made at a smaller ra- 
dius risoo, where less extrapolation for the high-z sample 
is needed. Although we attempt to match the analyses 
of the local 2MASS and high-z samples, systematic effects 
such as differences in star-galaxy separation in the two cat- 
alogs may still contribute to the differences in properties. 
However, if the high-z clusters do have a larger halo occu- 
pation number, our comparison then supports the finding 
of Kravtsov et al. (2003), and rules out the possibility that 
progenitors of present day high-mass clusters have smaller 
mean galaxy number density compared to that of the low 
mass clusters we see today. In other words, the redshift 
evolution of the HOD appears to go in the wrong direction 
to explain the flatness of the local N-M relation. Obser- 
vations have suggested a higher frequency of merger events 
in high redshift clusters (e.g. van Dokkum et al. 2000). It 
is therefore likely that the higher mean occupation num- 
ber found in high-z clusters is reduced over cosmic time as 
mergers take place. 

To improve our understanding of the evolution of the 
HOD, we have begun an observational program to improve 
the available data on the local and high-z samples. In addi- 
tion, a Spitzer Space Telescope GTO program will sample 
the rest-frame if -band light from distant clusters, which 
should provide more direct constraints on this aspect of 
cluster evolution. 

6. SYSTEMATIC UNCERTAINTIES 

Before we consider the implications of our findings on 
the hierarchical structure formation, it is important to ex- 
amine the robustness of our results. Here we investigate 
several factors that may potentially limit the interpreta- 
tion of our results. In the Appendix we further consider 
the effects of our cluster selection and background subtrac- 
tion method on the scaling relations. 

Galactic Contamination. To obtain a large sample, we 
choose objects 10 deg out of the Galactic plane; however, 
the possible confusion from stars may bias our results for 
clusters located at \b\ < 25° (e.g. Bell et al. 2003). We 
therefore rc-cxamine the scaling relations for the 78 clus- 
ters far from the Galactic plane (|6| > 25°). The results 
remain unchanged. 

Uncertainties in Cluster Radius and Center. Fitting the 
galaxy distribution requires knowledge of the virial radius 
and the cluster center for each cluster. To test the ro- 
bustness of our fit of the galaxy profile, we build mock 
clusters with characteristics similar to those in our sample 
(mass, redshift, halo occupation number, etc) and stack 
them, with uncertainties in the virial radius (up to 20%) 
and cluster center (up to 5') added. Without any uncer- 
tainties, our fitting routine gives a slightly positive biased 
result (Ac = Cfu — c input ~ 0.06 with scatter about 0.24). 
Uncertainties in the virial radius tend to cause a larger 
concentration than the true value (cfu > Ci nput ), while 
uncertainties in cluster center act in the opposite direc- 
tion. For example, a 10% uncertainty in the virial radius 
gives Ac <~ 0.26 with a scatter of 0.25, and a 1' uncertainty 
in cluster center results in Ac ~ —0.06 and a similar scat- 
ter. For probable uncertainties in the above two quantities 
(10 - 20% & - 1'), wc find that Ac - 0.15 - 0.35, with 
scatter about 0.25. Therefore, it is possible that the true 



galaxy distribution concentration is even smaller than 2.9 
found in §3.1. However, considering the fact that the LFs 
evaluated at r 50 o & r 2 oo also suggest c g <~ 3, we believe 
our result should not be too different from the true value. 

Shape of the Luminosity Function. The choice of a is im- 
portant, especially in deriving the total number of galaxies, 
and thus the halo occupation number. As we have shown 
in §3.2, the value wc have adopted is consistent with the 
stacked luminosity function from our cluster sample. We 
test the scaling relations with different choices of the faint- 
end slope: assuming a — —0.9 or —1.3 gives very similar 
(within la) slopes for light-mass correlation, while result- 
ing slopes of halo occupation number that are ~ 1.5ct away 
(and biased low and high, respectively) from the values 
presented for a = —1.1. 

Mass Temperature Relation. The M-Tx relation 
(Eqn 1) we adopt for estimating cluster mass from X-ray 
temperature is mostly suitable for systems more massive 
than M 500 = 3.57 x lO 13 /if o 1 M (Finoguenov et al. 2001). 
Although nearly all our clusters are more massive than 
this mass scale, it is important to consider the effect of a 
different M-Tx relation on the scaling relations found in 
this paper. Using a steeper M-Tx relation (e.g. the one 
with slope of 1.78 but a smaller normalization, see Table 
1 of Finoguenov et al. 2001) would result in slightly flat- 
ter L-M & N-M relations whose slopes are about 1 — 2a 
smaller than our best-fit values. 

Cluster Mass Dependence of Matter and Galaxy Dis- 
tribution. Numerical simulations suggest that the matter 
distribution concentration dm has a weak dependence on 
halo mass (Bullock et al. 2001). Because galaxies may 
generally follow the dark matter, it is also possible that 
c g is a function of cluster mass. Because r 500 is inferred 
directly from X-ray observations, while r 20 o is converted 
by assuming dm — 5, the relations evaluated at r 20 o may 
be affected by varying Cd m (M). Adopting the prescrip- 
tion for Cdm(M) given by Klypin et al. (1999) yields an 
almost identical L 2 oo _ -^20o relation. Systematic variation 
of c g (M) would come into our analysis through the con- 
version from the observed, "projected" quantities to those 
contained within the cluster spherical volume (see §3.1). 
As we show in §5.2, the data provide no indication of mass 
related trends in c g . 

These tests indicate that our results are not sensitive to 
the parameters or methods that we adopt. 

7. CONCLUSIONS 

We have analyzed a sample of 93 galaxy clusters 
whose virial masses range from 3 x 10 13 /if 1 M Q to 1.7 x 
1O 15 /iy O 1 M0. We have used the peak in X-ray surface 
brightness to define the cluster centers, and we have used 
the observed correlation between emission-weighted mean 
temperature and cluster virial mass (Finoguenov et al. 
2001) along with published cluster redshifts to estimate 
the scale of the cluster virial region. The following bulk 
properties of these systems are studied, with emphasis on 
any mass related trends: the composite cluster luminosity 
function, the galaxy distribution within the clusters, the 
correlation between total galaxy if -band luminosity and 
the cluster binding mass, and the halo occupation number. 

Our analysis shows that the luminosity function (ex- 
cluding the BCG) is well fit by a Schechter function with 
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a faint-end slope between —1.1 < a < —0.84, and charac- 
teristic magnitude and number density between —24.34 < 
M* - 5\ogh 70 < -24.02 and 3.01 tf Mpc" 3 < <j>* < 
4.43 /i 3 ^ Mpc -3 . A NIR survey of nearby clusters to a 
depth of Mk ^ — 19 would be useful in further constrain- 
ing the faint-end slope. The composite galaxy distribution 
(0.02 < r/r 20 o < 2) is found to be an NFW profile with 
concentration c g = 2.9 ± 0.2. This is somewhat smaller 
than the observed dark matter concentration, as well as 
that found in numerical studies. This implies that galax- 
ies have a more extended distribution than that of the dark 
matter. 

By solving the LF parameters L* and <f> t for each clus- 
ter, we obtain the total galaxy luminosity, as well as total 
galaxy number (§4) above some limiting absolute magni- 
tude Mk,Iow = —21. By comparing the galaxy NIR light- 
cluster mass correlation at different radii (rsoo & ?*20o) we 
find that the mass to light ratio decreases slightly with 
radius (as would be expected given the differing distribu- 
tions of galaxy and dark matter). The L-M correlation 
has a slope smaller than unity (0.69 ± 0.04, see Eqn 4, 
Table 1), which implies that cluster mass-to-light ratio is 
an increasing function of cluster mass. However, exclud- 
ing the BCGs significantly affects the normalization and 
the slope of the L-M correlation (but of course has only 
a minor effect on the halo occupation number). Without 
the BCGs, the total galaxy light roughly scales as the total 
galaxy number, which increases more rapidly with cluster 
mass (Eqn 6, slope is 0.84 ± 0.04; Table 1). The slope 
of the N-M correlation agrees well with the prediction 
of numerical simulations. In addition, we find the distri- 
bution of the halo occupation number about the mean is 
Poissonian (Figure 6). 

This flatness of the N-M relation in our local clus- 
ter sample indicates that higher mass clusters have fewer 
galaxies per unit mass than low mass clusters. We exam- 
ine the occupation number of higher redshift clusters using 
published NIR and X-ray data; in crude agreement with 
numerical simulations, the higher redshift clusters have 
more galaxies per unit mass than nearby clusters. As 
massive clusters accrete lower mass systems, there must 
be processes at work that alter the galaxy populations, 
either reducing galaxy masses or perhaps even destroying 
some galaxies. Through further examination of our lo- 
cal cluster sample we find that the radial distribution of 
galaxies appears to be independent of cluster mass; in ad- 
dition the ultra bright end of the luminosity function is 
simply not present within low mass clusters and groups, 
and the number density of galaxies fainter than about M* 
is higher in low mass clusters than in high mass clusters. 
Thus, the transformation of the galaxy population from 
low mass to high mass clusters appears to be a building 
up of a few extremely luminous galaxies and the destruc- 
tion or faintcning of galaxies fainter than M*. We note 
that there does not appear to be enough light in missing 
galaxies in high mass clusters to account for the increases 
in the bright end of the luminosity function. 

We investigate possible mechanisms that shape the slope 
of the halo occupation N-M correlation (§5). We find that 
none of the mechanisms we consider can by itself offer a 



satisfactory explanation of the galaxy population differ- 
ences between high and low mass clusters. Among these, 
tidal interactions (stripping, galaxy harassment) could be 
quite important, but presumably merging must also play a 
role in the production of very luminous galaxies. Further 
theoretical progress together with additional observational 
constraints from extensions of studies like ours will pre- 
sumably lead to a clearer picture of the process of galaxy 
evolution within the cluster environment. 

One implication of the observed N-M relation in both 
our local sample and the high-z sample is that galaxies are 
destroyed or dramatically altered through various dynami- 
cal processes in the cluster environment. One thus expects 
that the amount of the diffuse intracluster light would ex- 
hibit some correlation with the cluster mass (e.g. Malu- 
muth & Richstone 1984). In a companion paper (paper 
III), we will study the relationship between the BCG lu- 
minosity and the cluster mass, using it to limit the amount 
of the intracluster diffuse light. This will provide further 
insights into the cluster formation scenarios and the evo- 
lution of cluster galaxies. 

An improved understanding of the redshift evolution of 
the HOD is critical for those who would use deep, NIR 
or optical cluster surveys to study cosmology. High yield 
cluster surveys must employ simple mass estimators such 
as the galaxy number or luminosity, because it would not 
be feasible to conduct detailed mass measurements using 
X-ray imaging spectroscopy, galaxy velocity dispersions 
or weak lensing. Our study provides a local calibration 
for these mass-observable relations. However, extracting 
cosmology from the observed cluster redshift distribution 
will require knowledge of how these relations evolve. Our 
comparison of the local sample to the published data on 
the high-z clusters suggests that cluster virial regions are 
better represented by galaxy number and light at high 
redshift than they are locally. Depending on how the 
non-cluster galaxies are evolving, this could boost opti- 
cal and NIR cluster finding at high redshift. It remains to 
be seen whether the recently discussed cluster survey self- 
calibration approach (Majumdar & Mohr 2003; Hu 2003) 
will work with the higher scatter galaxy-based mass indi- 
cators. 
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APPENDIX 

Here we provide some tests that help assess the robustness of the scaling relation between the cluster light and mass 
presented in this paper. We show below that the use of the statistical background subtraction method is adequate and 
does not cause noticeable mass-related trends. Further we show the use of a heterogeneous cluster sample does not bias 
the L-M and N-M scaling relations. 

Statistical Background Subtraction. We first probe for mass related biases in the statistical background subtraction 
by examining the fraction of the observed flux that comes from the background. This may be a concern since for more 
massive clusters (which tend to be more distant) the total light derived may be more sensitive to the reliability of the 
statistical background method. Fig Al (left panel) shows there is no obvious trend related to cluster mass. Typically 16% 
of the observed flux is subtracted as background flux. For the 78 clusters that do not require large statistical background 
flux subtraction (fb gn / fobs < 0.23), we find the slope of the L-M relation remains the same. 

Furthermore, a comparison of the background galaxy counts estimated by the statistical background method and by 
the "annulus" background method can reveal how secure the former method is against field-to-field variations. Under 
the assumption that cluster galaxies are distributed according to an NFW profile with c g = 3, we solve for the constant 
background Yib,a and the overall normalization of the NFW profile by the total galaxy counts in cluster virial region and 
within a large annulus around the cluster (c.f. §2.2.1 in paper I). We show the resultant "annulus" background surface 
density in Fig Al (right panel). Given the large amplitude of the angular correlation function of the 2MASS galaxies 
(Mailer et al. 2003), which in turn implies a non-negligible contribution of background galaxy clustering (similar or larger 
than the Poisson error, e.g. Peebles 1980; Blake & Wall 2002), the large field-to-field variation is not surprising. However, 
we note that the weighted (unweighted) average T,b, a — 12.0 ± 0.5 (15.3 ± 1.1) deg -2 is in reasonable agreement with the 
expectation of Y, b , s = 14.4±0.3 deg -2 , obtained by integrating the log TV-log S to the 2MASS completeness limit K = 13.5 
(shown in the figure as a horizontal dashed line). Note that the uncertainty in T,b. s only accounts for the uncertainty in 
the logA^-logS" relation. It is interesting that for two clusters, Eh. a < 0, which probably indicates that an NFW profile 
does not describe the galaxy distribution well. Finally, we note that the clusters are sorted in an order of decreasing mass; 
it thus appears that employing the statistical background method does not introduce any mass trends. 

Cluster Selection. As we point out previously, our cluster sample is basically X-ray-selected; more massive clusters tend 
to be at higher redshifts, and therefore any redshift-rclated systematics may be interpreted as a mass trend. Following 
paper I, we divide our clusters into "hot" and "cold" subsamples, with kTx — 3.8 keV as the dividing point. The mean 
redshifts for the hot and cold subsamples are 0.056 and 0.031, respectively. To investigate if any redshift trend exists, 
we choose z = 0.043 to separates clusters into "nearby" and "distant" categories. In Fig A2 we show the LF parameters 
M* and 0*. The filled pentagons represent hot clusters, while open circles denote cold clusters. Furthermore, the relative 
sizes of the symbols indicate whether they are nearby or distant. Dividing the whole sample into distant and nearby 
subsamples, a two-dimensional Kolmogorov-Smirnov test indicates that the two subsamples are highly likely to be drawn 
from the same parent distribution. Within each temperature group, there is also no difference between M* and for 
distant and nearby clusters. 

Because 2MASS is a flux-limited survey, only the brightest galaxies can be observed in more distant clusters. Therefore 
deriving the LF for these distant- approaching z ~ 0.1- clusters (especially when they are not massive) requires some 
extrapolation. We have examined the scaling relations by excluding those clusters whose estimated total light is more than 
3 times the total observed light. The changes in the slopes of the scaling relations are within the statistical uncertainties. 
Finally, we consider a subsample of 49 clusters that are within HIFLUGCS (Reiprich & Bohringer 2002), which is a X-ray 




Fig. Al. — Left: ratio of the estimated background flux to the total observed flux, as a function of cluster mass. Right: comparison between 
the "annulus" background surface densities (points) and that estimated from the all-sky log JV-logS relation (dashed line, Ej, s = 14.4 rfc 0.3 
deg -2 ). The solid line is the weighted average value of the points (>!(, a = 12.0 ± 0.5 deg -2 ). 
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flux-limited sample. The scaling relations based on this subsample show slight increases in slope, but all within la from 
those listed in Table 1. 
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156 ± 32 


a2420 





0846 


6?i 

5.8* 




8 


4.86 ± 1.61 


-24.46 ± 


75 


4.71 ± 0.10 


86 ± 32 


8.40 ± 0.06 


241 ± 140 


a0119 





0440 


36 


40 


4.13 ± 0.56 


-25.03 ± 


34 


6.46 ± 0.07 


91 ± 16 


10.10 ± 0.08 


133 ± 21 


a3391 





0531 


5.7 
5.7 


® 


23 


3.50 ± 0.61 


-24.94 ± 


44 


5.84 ± 0.09 


72 ± 16 


9.29 ± 0.07 


132 ± 25 


a3558 





0480 


60 


6.73 ± 0.68 


-25.13 ± 


26 


11.10 ± 0.06 


147 ± 20 


17.98 ± 0.05 


260 ± 28 


a3158 





0590 


5.5 


31 


7.70 ± 0.98 


-24.57 ± 


35 


6.76 ± 0.05 


129 ± 22 


9.13 ± 0.05 


169 ± 30 


al991 





0590 


5.4° 


§ 

'?. 206 

I:, 1 .? 1 


17 


4.48 ± 0.90 


-24.59 ± 


50 


4.06 ± 0.07 


73 ± 19 


5.47 ± 0.06 


108 ± 32 


a2065 





0726 


5.37 


23 


11.20 ± 1.41 


-24.44 ± 


38 


7.33 ± 0.05 


167 ± 28 


12.40 ± 0.03 


329 ± 60 


al795 





0631 


5.34 


17 


9.76 ± 1.90 


-23.99 ± 


48 


4.80 ± 0.04 


123 ± 29 


6.25 ± 0.04 


182 ± 58 


a3822 





0760 


5.12 


8:8? 
8:ig 7 

0.42 


16 


4.97 ± 0.86 


-25.04 ± 


48 


5.38 ± 0.10 


85 ± 19 


9.36 ± 0.07 


165 ± 34 


a2734 





0620 


5.07 


11 


3.53 ± 0.94 


-24.60 ± 


64 


3.13 ± 0.09 


53 ± 17 


4.11 ± 0.07 


89 ± 36 


a3395sw 





0510 


r0.3 


30 


5.77 ± 0.81 


-24.80 ± 


37 


5.61 ± 0.06 


92 ± 17 


7.98 ± 0.06 


134 ± 24 


a0376 





0484 




29 


7.59 ± 1.10 


-24.30 ± 


37 


4.57 ± 0.04 


100 ± 19 


6.49 ± 0.03 


171 ± 33 


al314 





0335 


r4.5 

°1.8 




22 


2.53 ± 0.64 


-24.69 ± 


58 


2.48 ± 0.09 


40 ± 13 


2.60 ± 0.16 


38 ± 20 


a2147 





0351 


4.91 


0.18 


42 


5.76 ± 0.80 


-24.50 ± 


35 


4.15 ± 0.05 


81 ± 15 


7.58 ± 0.04 


150 ± 22 


a3112 





0750 


4.7° 


hi* 

¥ 
I 


8 


6.00 ± 1.73 


-24.38 ± 


71 


4.11 ± 0.08 


71 ± 24 


6.44 ± 0.06 


150 ± 57 


a 1644 





0474 


4 "fi 


34 


7.96 ± 1.03 


-24.51 ± 


34 


5.82 ± 0.05 


103 ± 18 


7.91 ± 0.04 


150 ± 25 


a2199 





0303 


4.5H 


52 


7.35 ± 0.93 


-24.36 ± 


32 


4.43 ± 0.04 


86 ± 15 


6.88 ± 0.04 


138 ± 20 


a2107 





0421 


4.31 


l}.57 
0.35 


23 


5.54 ± 1.03 


-24.36 ± 


46 


3.26 ± 0.05 


60 ± 15 


4.38 ± 0.07 


76 ± 19 


a0193 





0486 


4.2* 


a 

?8 


17 


5.45 ± 1.13 


-24.39 ± 


52 


3.16 ± 0.06 


57 ± 15 


4.05 ± 0.05 


83 ± 24 


a2063 





0355 


4.1 


38 


8.36 ± 1.16 


-24.23 ± 


35 


3.50 ± 0.04 


79 ± 15 


4.53 ± 0.04 


107 ± 21 


a4059 





0475 


4.1 
4.1* 


14 


3.40 ± 0.83 


-24.77 ± 


60 


3.12 ± 0.10 


39 ± 12 


4.71 ± 0.07 


78 ± 20 


al767 





0701 


8 8 


11 


8.23 ± 1.77 


-24.38 ± 


58 


4.20 ± 0.07 


79 ± 21 


5.36 ± 0.07 


99 ± 31 


a0576 





0389 


4.02 


#07 
0.07 


35 


6.49 ± 0.92 


-24.79 ± 


36 


4.62 ± 0.06 


74 ± 14 


7.55 ± 0.04 


143 ± 21 


a3376 





0456 


4 0.4 

^0.4 


27 


9.51 ± 1.41 


-24.20 ± 


39 


3.81 ± 0.04 


85 ± 17 


5.02 ± 0.04 


117 ± 25 
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Fig. A2. — The LF parameters for individual clusters. Filled pentagons: clusters with kTx > 3.8 keV, open circles: those with kTx < 3.8 
keV. Larger symbols: clusters at 2 < 0.043, smaller symbols: those at 2 > 0.043. 
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Table 2 

Derived Quantities of the Clusters (Continued) 







T x 






0*,5OO 


M, - 51og/i 70 




-^500 




-^200 






Name 


z 


(koV) 


N obs 
(4) 


(h 3 70 Mpc- 3 ) 


(mag) 




(h m 2 1O 12 L ) 


N500 


(h- 2 10 12 


Lq) 


iV 2 oo 


(1) 


(2) 


(3) 




(5) 


(6) 






(7) 


(8) 


(9) 




(10) 


a0133 


0.0569 


3.97" 


1 ( 


15 


13.16 ± 2.63 


-23.81 ± 


51 


3 


72 ± 0.04 


96 ± 24 


5.35 ± 


03 


150 ± 40 


a0496 


0.0328 


3.91° 


11 

04 


37 


6.42 ± 0.94 


-24.62 ± 


37 


3 


91 ± 0.05 


66 ± 13 


5.47 ± 


04 


106 ± 18 


all85 


0.0325 


CO CO 

CD CD 
D O- 1 to 


28 


4.74 ± 0.87 


-24.63 ± 


45 


2 


84 ± 0.07 


49 ± 12 


4.13 ± 


06 


76 ± 17 


awm7 


0.0172 


2 

16 

22 

2 
2 


51 


4.96 ± 0.79 


-24.49 ± 


48 


2 


90 ± 0.06 


50 ± 10 


3.62 ± 


06 


74 ± 15 


a2440 


0.0904 


3.880 


5 


2.26 ± 0.75 


-25.64 ± 


88 


3 


35 ± 0.33 


30 ± 12 


4.51 ± 


27 


45 ± 19 


a3560 


0.0489 


3.87» 


24 


3.12 ± 0.56 


-26.17 ± 


44 


6 


50 ± 0.27 


50 ± 11 


9.52 ± 


18 


82 ± 15 


a0780 


0.0538 


3.8 : 


11 


24.14 ± 7.35 


-23.16 ± 


61 


3 


09 ± 0.02 


117 ± 37 


3.24 ± 


04 


90 ± 35 


a3562 


0.0499 






20 


5.74 ± 1.02 


-24.81 ± 


45 


3 


23 ± 0.08 


59 ± 14 


6.40 ± 


05 


118 ± 22 


a2670 


0.0762 


3.73^ 


17 
13 


13 


13.31 ± 2.18 


-24.41 ± 


51 


5 


74 ± 0.06 


111 ± 23 


8.72 ± 


05 


174 ± 36 


a2657 


0.0404 


3.7§:| 


17 


4.07 ± 0.90 


-24.73 ± 


55 


2 


35 ± 0.09 


40 ± 11 


2.84 ± 


09 


52 ± 16 


all42 


0.0349 


3-7} : 3 




13 


2.25 ± 0.70 


-24.88 ± 


77 


1 


84 ± 0.14 


24 ± 9 


2.71 ± 


16 


37 ± 14 


a2634 


0.0314 


, 7 o.i 


8 

A 


38 


5.62 ± 0.83 


-25.00 ± 


38 


4 


53 ± 0.08 


61 ± 12 


6.62 ± 


06 


104 ± 17 


2a0335 


0.0349 


3.64" 


054 
18 
OS 
08 


28 


5.81 ± 0.95 


-24.72 ± 


41 


3 


44 ± 0.07 


56 ± 12 


5.29 ± 


07 


86 ± 16 


a3526 


0.0114 


3.54 


59 


4.02 ± 0.71 


-25.12 ± 


67 


3 


66 ± 0.16 


44 ± 10 


5.24 ± 


18 


68 ± 14 


al367 


0.0216 


3 5 1 


I 

1 
2 

2 


60 


7.81 ± 0.99 


-24.62 ± 


34 


3 


81 ± 0.05 


69 ± 12 


5.46 ± 


06 


93 ± 15 


mkw03s 


0.0434 


3 5 8:! 
3 5 8:3 

J -°0.4 


13 


4.28 ± 1.08 


-24.53 ± 


62 


1 


96 ± 0.08 


36 ± 12 


2.76 ± 


06 


61 ± 19 


al736 


0.0458 




27 


5.84 ± 0.93 


-25.22 ± 


40 


5 


28 ± 0.10 


62 ± 13 


10.97 ± 0.08 


127 ± 18 


hcg094 


0.0417 


3.45" 


3 


12 


3.35 ± 0.94 


-24.68 ± 


69 


2 


51 ± 0.11 


29 ± 10 


4.37 ± 


12 


53 ± 14 


a2589 


0.0416 


3.38°, 


§8 


18 


6.31 ± 1.32 


-24.33 ± 


52 


2 


51 ± 0.06 


46 ± 13 


2.91 ± 


05 


65 ± 20 


mkw08 


0.0270 


3.29? 


SI 


24 


4.25 ± 0.91 


-24.64 ± 


56 


2 


13 ± 0.08 


34 ± 9 


2.69 ± 


09 


46 ± 13 


a4038 


0.0283 


3.15 


81 


34 


7.07 ± 1.13 


-24.58 ± 


41 


2 


85 ± 0.06 


52 ± 11 


3.55 ± 


06 


70 ± 15 


al060 


0.0114 


3 I ' 1 


8 6 


69 


7.18 ± 1.07 


-24.41 ± 


52 


2 


39 ± 0.06 


49 ± 10 


3.17 ± 


08 


68 ± 14 


a2052 


0.0348 


l\u 




30 


9.07 ± 1.39 


-24.41 ± 


39 


3 


22 ± 0.05 


60 ± 12 


4.43 ± 


04 


93 ± 18 


a0548e 


0.0395 


3 I 81 




30 


7.83 ± 1.18 


-24.94 ± 


38 


4 


17 ± 0.08 


63 ± 12 


6.70 ± 


06 


107 ± 17 


a2593 


0.0433 


3 




22 


7.22 ± 1.27 


-24.72 ± 


45 


3 


38 ± 0.07 


53 ± 12 


5.41 ± 


06 


90 ± 17 


a0539 


0.0288 


3.04" 


066 
06 


44 


9.95 ± 1.31 


-24.57 ± 


33 


3 


72 ± 0.05 


68 ± 12 


4.98 ± 


04 


101 ± 16 


asllOl 


0.0580 


ol.2 




8 


7.18 ± 2.01 


-24.28 ± 


72 


2 


18 ± 0.08 


41 ± 15 


3.16 ± 


05 


75 ± 27 


a0779 


0.0230 


2.97°, 


24 


15 


2.71 ± 0.85 


-24.69 ± 


88 


1 


72 ± 0.14 


19 ± 8 


2.25 ± 


19 


27 ± 11 


awm4 


0.0326 


2.92? 


11 
24 


12 


6.52 ± 2.03 


-23.63 ± 


71 


1 


48 ± 0.04 


28 ± 11 


1.73 ± 


04 


39 ± 18 


exo0422 


0.0390 


2 9 5 
2.9^ 


11 


4.63 ± 1.35 


-24.36 ± 


71 


1 


63 ± 0.08 


27 ± 10 


2.04 ± 


06 


42 ± 16 


a2626 


0.0553 


15 


8.85 ± 1.61 


-24.67 ± 


50 


3 


57 ± 0.08 


57 ± 14 


4.49 ± 


07 


77 ± 20 


zwl615 


0.0302 


2.9?? 




23 


7.49 ± 1.58 


-24.09 ± 


50 


1 


68 ± 0.05 


40 ± 11 


2.18 ± 


05 


53 ± 17 


mkw09 


0.0397 


2.662;" 


12 


7.71 ± 2.08 


-23.95 ± 


65 


1 


48 ± 0.05 


33 ± 12 


2.27 ± 


05 


54 ± 17 


a0194 


0.0180 


2.63°, 


15 
15 


24 


3.91 ± 0.97 


-24.84 ± 


73 


1 


97 ± 0.13 


24 ± 8 


2.85 ± 


31 


29 ± 10 


a0168 


0.0450 




21 


10.45 ± 1.74 


-24.60 ± 


44 


3 


17 ± 0.06 


55 ± 13 


4.25 ± 


05 


84 ± 18 


a0400 


0.0240 


2-43°°™ 


29 


7.82 ± 1.40 


-24.57 ± 


46 


2 


34 ± 0.06 


38 ± 9 


3.46 ± 


05 


65 ± 13 


a0262 


0.0161 


2.41° 


03 
():; 


39 


7.52 ± 1.33 


-24.53 ± 


51 


2 


07 ± 0.07 


36 ± 8 


3.03 ± 


06 


60 ± 12 


a2151 


0.0369 


2 4 0.06 
z -^0.06 


21 


6.04 ± 1.18 


-25.38 ± 


48 


3 


44 ± 0.14 


38 ± 9 


5.17 ± 


09 


70 ± 13 


mkw04s 


0.0283 


2.13° 


4 
4 


8 


2.86 ± 1.21 


-24.72 ± 1 


08 


1 


36 ± 0.17 


12 ± 6 


1.59 ± 


10 


22 ± 10 


a3389 


0.0265 


2 1°' E 
"^O.E 




25 


7.35 ± 1.42 


-25.12 ± 


48 


2 


81 ± 0.11 


35 ± 8 


4.09 ± 


08 


61 ± 12 


ivzw038 


0.0170 


2.07° 


56 

a? 


20 


4.37 ± 1.20 


-25.04 ± 


79 


1 


76 ± 0.17 


20 ± 7 


2.34 ± 


12 


35 ± 10 


as0636 


0.0116 


2.06? 


31 


5.09 ± 1.23 


-25.12 ± 


83 


1 


89 ± 0.19 


24 ± 7 


2.40 ± 


24 


32 ± 10 


a3581 


0.0230 


1.83° 


81 


15 


6.34 ± 1.71 


-24.51 ± 


70 


1 


22 ± 0.09 


20 ± 7 


1.52 ± 


16 


22 ± 8 


mkw04 


0.0200 


1.71 


83 
?§ 

16 


13 


4.01 ± 1.32 


-25.23 ± 


90 


1 


71 ± 0.23 


15 ± 6 


2.35 ± 


26 


22 ± 8 


ngc6338 


0.0282 


1.69° 


15 


9.07 ± 2.22 


-24.39 ± 


61 


1 


58 ± 0.07 


23 ± 7 


1.69 ± 


07 


29 ± 10 


a0076 


0.0405 


1 


15 


11.29 ± 2.42 


-25.14 ± 


53 


2 


03 ± 0.13 


30 ± 8 


3.56 ± 


06 


58 ± 13 


ngc6329 


0.0276 


1.45° 


08 
08 


13 


9.46 ± 2.49 


-24.47 ± 


66 


1 


26 ± 0.08 


20 ± 7 


1.56 ± 


09 


26 ± 9 


as0805 


0.0140 




28 


13.67 ± 2.85 


-24.07 ± 


57 


1 


30 ± 0.05 


23 ± 6 


1.75 ± 


09 


30 ± 9 


ngc0507 


0.0165 


1.26° 


07 


21 


9.39 ± 2.23 


-25.18 ± 


63 


1 


94 ± 0.16 


21 ± 6 


2.57 ± 


18 


28 ± 8 


ngc2563 


0.0163 


1.06° 


81 

04 


13 


10.50 ± 3.33 


-24.18 ± 


85 





75 ± 0.08 


12 ± 5 


1.08 ± 


08 


21 ± 7 


wp23 


0.0087 


i 0.6 
°' 4 n 


7 


2.71 ± 2.71 


-26.01 ± 2 


83 





89 ± 0.89 


6 ± 6 


1.09 ± 


93 


10 ± 10 


ic4296 


0.0133 


0.95n 


09 


14 


11.41 ± 3.66 


-24.25 ± 


90 


1 


25 ± 0.09 


12 ± 5 


1.63 ± 


13 


18 ± 7 


hcg062 


0.0137 


0.87°, ° 2 


8 


7.15 ± 3.42 


-24.43 ± 1 


37 





59 ± 0.17 


7±4 


0.92 ± 


23 


12 ± 6 



Note. — Columns: (1) Name; (2) Redshift; (3) Emission-weighted mean temperature; (4) Estimated number of member galaxies observed; 
(5) Characteristic number density; (6) Characteristic magnitude; (7) Total luminosity (with BCG) within rsoo brighter than = —21; (8) 
Total number (with BCG) of galaxies within rsoo brighter than Mk = —21; (9) & (10) Same as (7) & (8), but within r2oo- Uncertainties in 
all columns quoted at Icr level. M* & </>* calculated assuming a = —1.1. 
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